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Abstract 

Background: Chromosomal inversions are increasingly being recognized as important in adaptive shifts and are 
expected to influence patterns of genetic variation, but few studies have examined genetic patterns in inversion 
polymorphisms across and within populations. Here, we examine genetic variation at 20 microsatellite loci and the 
alcohol dehydrogenase gene [Adh) located within and near the ln{2L)t inversion of Drosophila melanogaster at three 
different sites along a latitudinal dine on the east coast of Australia. 

Results: We found significant genetic differentiation between the standard and inverted chromosomal 
arrangements at each site as well as significant, but smaller differences among sites in the same arrangement. 
Genetic differentiation between pairs of sites was higher for inverted chromosomes than standard chromosomes, 
while inverted chromosomes had lower levels of genetic variation even well away from inversion breakpoints. 
Bayesian clustering analysis provided evidence of genetic exchange between chromosomal arrangements at each 
site. 

Conclusions: The strong differentiation between arrangements and reduced variation in the inverted 
chromosomes are likely to reflect ongoing selection at multiple loci within the inverted region. They may also 
reflect lower effective population sizes of ln(2L)t chromosomes and colonization of Australia, although there was no 
consistent evidence of a recent bottleneck and simulations suggest that differences between arrangements would 
not persist unless rates of gene exchange between them were low. Genetic patterns therefore support the notion 
of selection and linkage disequilibrium contributing to inversion polymorphisms, although more work is needed to 
determine whether there are spatially varying targets of selection within this inversion. They also support the idea 
that the allelic content within an inversion can vary between geographic locations. 
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Background can be identified by examining the banding patterns of 

Chromosome inversions occur when a chromosome chromosomes in the larval salivary gland cells, and by 
breaks in two places and the segment between the the formation of loops during chromosomal pairing be- 
breakpoints is re-inserted in the reverse orientation. In tween inverted and standard arrangements, making 
Drosophila and other Diptera, inverted and noninverted them convenient genetic markers for studying evolution 
(standard) forms of chromosomes often coexist within [1,2]. Studies on inversion frequency changes in natural 
the same population. These inversion polymorphisms and laboratory populations of Drosophila by Dobzhansky 

and his colleagues provided early evidence that inversion 
polymorphisms are under strong selection and adapta- 
tion [3,4]. Since then evidence for selection and adapta- 
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numbers of species including plants [5], seaweed flies 
[6], butterflies [7], Anopheles mosquitoes [8], fruit flies 
[9] and humans [10]. Inversions are also thought to play 
a role in the evolution of sex chromosomes and sped- 
ation [1]. 

The spread and maintenance of inversion polymor- 
phisms is thought to be due to their impact on linkage 
disequilibrium. Inversions maintain associations between 
alleles because crossing over between inverted and 
standard arrangements gives rise to nonfunctional mei- 
otic products. Under the coadaptation hypothesis pro- 
posed by Dobzhansky, inversions have selective value 
because they hold together favourable combinations of 
alleles [3,11]. A crucial aspect of the hypothesis is that 
alleles at loci within the inversion have epistatic interac- 
tions that increase fitness. Heterosis and the idea that 
the allelic content of the inversion evolves after inver- 
sions arise are also assumed, leading to different alleles 
in populations within the same inversion [12,13]. 

An alternative hypothesis is that inversions have se- 
lective value because they bring together two or more al- 
leles that are adapted to local conditions [14]. With this 
model, no epistasis is needed for the inversion to gain a 
fitness advantage, so the mechanism can operate even 
when alleles are adapting to different environmental var- 
iables. It also does not require sets of alleles to become 
coadapted within a chromosomal arrangement in a 
population, or for the presence of heterosis. This mech- 
anism may therefore occur much more frequently than 
mechanisms involving coadaptation [2]. An inversion 
harbouring locally adapted alleles will go to fixation un- 
less a polymorphism is maintained by migration or bal- 
ancing selection. Other explanations why inversions 
spread through populations include direct selection on 
the inversion (rather than its effects on recombination) 
arising from a mutation at the breakpoints, underdomi- 
nance and overdominance [1,2]. 

Much of the empirical support for the idea that inver- 
sions are locally adapted comes from laboratory experi- 
ments on Drosophila reviewed in [11]. Several studies 
have shown that changes in inversion frequencies in 
population cages depend on when and where samples 
from natural populations were taken [15] and how they 
have been maintained [16,17]. More recently, Lowry and 
Willis [5] have used reciprocal transplant experiments 
involving outbred lines where inversion chromosomal 
arrangements were introgressed into different genetic 
backgrounds to demonstrate local adaptation in the yel- 
low monkey flower. 

Molecular studies provide further evidence that inver- 
sions evolve over time and are involved in local 
adaptation. Levels of linkage disequilibrium (LD) and 
nucleotide divergence between inverted and standard 
chromosome arrangement change over time, and 



become reduced towards the middle of the inversion 
where multiple crossover and gene conversion are 
expected to be higher see [1], although this is not always 
the case (e.g., [18]). Patterns of LD within inversions 
may also reflect selection as well as recombination and 
historical processes; in Drosophila pseudoobscura and 
Drosophila melanogaster, LD between genes within in- 
version decreases as they are situated further apart, but 
some nonadjacent genes maintain high LD with regions 
of low LD between them, suggesting selection at loci 
across the inverted region [13,19]. 

By contrast, molecular evidence for local adaptation is 
surprisingly scarce and inconsistent. Allozyme studies 
on D. pseudoobscura have shown that the same inver- 
sions from different populations have unique combina- 
tions of alleles [20,21]. However, samples sizes in these 
studies tended to be small [13] and nucleotide sequences 
of genes situated within inversions show no significant 
differences among populations within the same arrange- 
ments [13,22], In mosquitoes, patterns of nucleotide 
divergence between chromosomal arrangements are in- 
consistent with neutral models [23], but these patterns 
are not always found [24] and clear footprints of select- 
ive sweeps or balancing selection on genes within inver- 
sions are uncommon [23,24]. However, signatures of 
selection have been reported in genes within inversions 
in Drosophila (e.g., [18,25]). 

Here, we examine geographic variation in the In(2L)t 
inversion in populations of D. melanogaster along the 
east coast of Australia. This chromosome arrangement 
is located in the middle of the left arm on chromosome 
2 (breakpoints 22D3-E1 and 34A8-9) and has been the 
focus of many studies due its close proximity to the al- 
cohol dehydrogenase (Adh) locus. As found on other 
continents, both Adh and In(2L)t show latitudinal clines 
in eastern Australia, with higher frequencies of the Adh s 
allele and In(2L)t at lower latitudes, providing strong 
evidence that natural selection is maintaining the poly- 
morphisms [26,27]. 

If In(2L)t is locally adapted along eastern Australia, we 
would expect to see differences in the allelic content of 
inverted chromosomes between sites from different lati- 
tudes. Due to the interaction between recombination 
processes (gene conversion and double crossovers) and 
selection, we would also expect to see a mosaic of more 
and less differentiated regions between standard and 
inverted chromosomes [23] that differ among sites, as 
well as different patterns of LD within In(2L)t among 
sites. We test these predictions by examining microsatel- 
lite variation within standard and In(2L)t chromosomes 
at three different sites (Figure 1). Haplotypes were 
obtained by crossing isofemale lines to an isogenic strain 
and subtracting the allele present in the isogenic strain 
from the genotypes of the ¥ 1 progeny. Under the 



Kennington and Hoffmann BMC Evolutionary Biology 2013, 13:100 
http://www.biomedcentral.eom/1 471 -21 48/1 3/1 00 



Page 3 of 1 1 



c \ 



A 


10° s — 


•) Innisfail 


20° S — 


1 

Coffs Harbour •/ 


30° S — 


Melbourne / 






40° S — 


1000 km ( ^~^t 





Figure 1 Map of the eastern coast of Australia showing the 
sampling sites. 
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assumption that microsatellites are neutral, no differ- 
ences between chromosome arrangements or among the 
same arrangement from different sites are expected by 
selection on the microsatellite loci themselves. However, 
hitchhiking effects aided by extensive LD across the 
inverted regions, as well as mutations accumulating in 
chromosomes after hitchhiking events, may lead to 
genetic differences in the allelic content of In(2L)t along 
the cline. 

Results 

Marker variation 

Large differences in diversity were observed between 
markers. The number of alleles ranged from two to 23 
and gene diversities ranged from 0.01 to 0.83 (Table 1). 
All individuals from the isogenic strain were homozy- 
gous for the same allele at all markers and all F 2 individ- 
uals had a genotype that included at least one allele 
from the isogenic strain. There were marked differences 
in the frequencies of the Adh s allele and In(2L)t 
among sites. Frequencies of the Adh s allele ranged 
from 0.34 to 0.91 and In(2L)t from 0.04 to 0.29. These 
frequencies were very close to those reported by 
Umina et al. [28], suggesting the crossing scheme did 
not favour a particular Adh allele or chromosome ar- 
rangement (Figure 2). 

Genetic diversity tended to be lower in chromosomes 
with In(2L)t compared to those with the standard ar- 
rangement (Figure 3). Pairwise tests between chromo- 
some arrangements at each site revealed significantly 
higher allelic richness in the standard chromosomes 



Table 1 Details of the markers used in this study 

Position 



Marker 


Genetic (cM) 


Cytological 


No. of alleles 


H 


DROEXPAND 


0.7 


21 C4 


7 


0.47 


DROYANETSB 


5 


22D1-D2 


23 


0.80 


AC009392 


7 


23A-E 


8 


0.72 


DS01340 


10 


24A1-A2 


6 


0.50 


AC004373 


12 


24F1-F2 


9 


0.69 


AC004721 


17 


25 E6 


5 


0.37 


DROGPDHA 


18 


26 A 1 


3 


0.51 


AC004758 


18 


26A5-B5 


14 


0.78 


DRONINAC 


27 


28A1-A3 


4 


0.60 


AC004722 


28 


28C2-C4 


5 


0.48 


AC005555 


31 


29A1-C1 


9 


0.77 


AC005889 


35 


30A3-A6 


7 


0.64 


DMBIBGENE 


38 


30F 


7 


0.64 


DMU12269 


39 


31A1-A3 


21 


0.81 


DRODANS 


42 


31E 


2 


0.01 


AC005115 


44 


32D2-D4 


5 


0.43 


G410 


46 


33E9-E10 


18 


0.83 


AC006302 


48 


34C4-D2 


10 


0.67 


AC004118 


50 


35B2-B3 


20 


0.83 


Adh 


50 


35B3 


2 


0.35 


DRODORSAL 


53 


36C 


10 


0.56 



Markers in bold type are located within the breakpoints of ln{2L)t. H is 
gene diversity. 



at Innisfail (P = 0.014) and when all sites were pooled 
(P = 0.044), but there were no significant differences in 
allelic richness between chromosome arrangements at 
Coffs Harbour (P = 0.228) or Melbourne (P = 0.389). 
There were also no significant differences between sites 
within each chromosome arrangement (x 2 = 4.30, 
P = 0.116 and x 2 = 2.14, P = 0.344 for the inverted and 
standard chromosomes respectively). Gene diversity 
did not differ significantly between chromosome ar- 
rangements at any site (Innisfail: P = 0.117; Coffs 
Harbour: P = 0.410; Melbourne: P = 0.188) or between 
sites within standard chromosomes (x 2 = 3.76, P = 
0.152), but there were significant differences between 
sites in the inverted chromosomes (x 2 = 7.36, P = 
0.025). Pairwise tests indicated that the differences in 
gene diversity were between the Innisfail and Melbourne 
sites (P = 0.002). 

There was a significant heterozygosity excess at the 
Melbourne site in the inverted chromosomes (Wilcoxon 
test, P = 0.002), indicating a recent severe reduction in 
effective population size. None of the other site/ 
chromosome arrangement combinations had higher than 
expected heterozygosities (Wilcoxon P-values ranged 
from 0.830 to 0.997). 
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Figure 2 Relationship between latitude and frequency of ln{2L)t 
(A) and the Adh s allele (B) in populations of Drosophila 
melanogaster collected along the east coast of Australia. Solid 
circles are data from this study. Open circles are data taken from 
Umina et al. [28]. Error bars are standard errors. 
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Figure 3 Levels of genetic variation within ln{2L)t (black) and 
standard arrangement (open) chromosomes. Gene diversity (A) 
and allelic richness (B). Error bars are standard errors. 



Genetic differentiation within and between chromosome 
arrangements 

Nearly all markers within In(2L)t showed significant 
differentiation between the standard and inverted 
chromosomal arrangements at the Innisfail and Coffs 
Harbour sites (Figure 4). Significantly differentiated 
markers were also evident outside the inversion 
breakpoints (Figure 4), but they were less common 
and the divergences were lower (inside mean F ST = 
0.28 and 0.22, outside mean F ST = 0.12 and 0.13). 
Fewer markers were significantly differentiated at the 
Melbourne site, a likely consequence of the relatively 
small number of inverted chromosomes sampled at 
this site. Overall, patterns of differentiation between 
inverted and standard chromosomes were quite simi- 
lar across sites (Spearman's rank correlations between 
F ST values at different sites were significant, P < 0.05 
in all cases), but there were notable differences. For 
example, the region of highest differentiation between 
chromosome arrangements at Innisfail was in the 
middle of In{2L)t, but it was more evenly spread 
across the inversion at Coffs Harbour (Figure 4). 

In addition to differentiation between chromosome 
arrangements at each site, there were significant 
differences among sites within each chromosome 



arrangement (inversion: F ST = 0.070, P < 0.001; stand- 
ard F ST = 0.021, P < 0.001). However, the differences 
among sites within chromosome arrangements were 
small compared to the differences between inverted 
and standard chromosomes. AMOVA revealed that 
26.9% of the total genetic variation was among 
chromosome arrangements, while only 2.1% occurred 
among sites within chromosome arrangements. Levels 
of differentiation between pairs of sites within each 
chromosome arrangement increased as the geograph- 
ical distance between them increased (Figure 5). It 
was also apparent from non-overlapping 95% confi- 
dence limits (CLs) that genetic differentiation between 
sites was significantly higher within the inverted chro- 
mosomes than it was within standard chromosomes 
(Figure 5). 

The outlier analyses identified a single marker, Adh, 
with an excessively high F S t value compared to 
neutral expectations. It was an outlier in the compari- 
son between Innisfail and Melbourne (P = 0.002). 
Adh also had a high F ST value relative to other 
markers in the comparison between Coffs Harbour 
and Melbourne, but was non-significant after correc- 
ting for multiple comparisons using a false discovery 
rate of 10% (P = 0.023). 
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Figure 4 Marker divergences between chromosome 
arrangements at each site. Solid symbols represent markers with 
significant divergence after correction for multiple comparisons. 
Open symbols are not significantly different to zero. The crosses on 
the x-axis depict the locations of the ln(2L)t breakpoints. 



Linkage disequilibrium and genetic mixing 

Associations between marker alleles and In(2L)t were 
evident at all sites (Figure 6), though they tended to be 
higher in the low and middle latitude sites (mean r 2 = 
0.16 and 0.12 for Innisfail and Coffs Harbour respect- 
ively) than in the high latitude site at Melbourne (mean 
r 2 = 0.03). Nevertheless, associations between alleles and 
In(2L)t were consistent across sites. Spearman's rank 
correlations between r 2 values at different sites were 
highly significant (P < 0.002 in all cases) with r s values 
ranging from 0.48 to 0.84. 

Cluster analysis on the genetic markers situated within 
In(2L)t revealed a strong correspondence between mem- 
bership to a particular cluster and chromosome arrange- 
ment (Figure 7). In most cases, individual chromosomes 
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Figure 5 Relationship between genetic and geographical 
divergence. Solid circles are ln{2L)t chromosomes. Open circles are 
standard arrangement chromosomes. Error bars are 95% confidence 
limits calculated by bootstrapping over loci. 



were assigned completely to one cluster, with each 
chromosome arrangement being represented by a differ- 
ent cluster. However, there were exceptions. Some chro- 
mosomes were assigned to the cluster that represented 
the alternate chromosome arrangement. There were also 
several chromosomes with a significant proportion of 
membership to both clusters (Figure 7). 



Decay of linkage disequilibrium and genetic 
differentiation between chromosome arrangements 

Our simulations suggest that double-crossovers will 
homogenize allele frequencies and breakdown LD at loci 
in the middle of the inversion quickly provided the rate 
of gene exchange is high. When simulations were run 
using a rate of gene exchange calculated from map 
distances (0.0094 per generation), genetic differentiation 
between chromosome arrangements (subpopulations) 
declined sharply with F ST 's close to zero by generation 
800, even when the initial allele frequency differences 
were at a maximum (Figure 8). Declines in genetic diver- 
gence were also evident when the rate of gene exchange 
was set at 0.0001 per generation, but F S t's were still well 
above zero after 1400 generations (Figure 8). Similar 
patterns were observed when levels of gene diversity and 
allelic richness were compared between chromosome ar- 
rangements. When the level of gene exchange was set at 
the higher rate, differences in genetic diversity decreased 
rapidly to be close to zero by generation 800, irrespective 
of the starting divergence between chromosome arrange- 
ments. However, under the lower rate of gene exchange, 
significant differences in gene diversity and allelic richness 
(with higher levels of variation in the standard arrange- 
ment) were still present at generation 1400. When simula- 
tions were started using maximum divergence between 
chromosome arrangements (i.e. each chromosome ar- 
rangement was fixed for a different allele), differences were 
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either non-significant or there was higher genetic diversity 
in the inverted chromosomes. 

No significant LD between marker alleles and the in- 
version was found in the simulated data sets at gener- 
ation 1400 when the higher rate of gene exchange was 
used (mean r 2 = 0.02, much lower than the observed 
values in Figure 6). However, some LD was detected 
when the lower rate of gene exchange and maximum 
levels of genetic divergence between chromosome ar- 
rangements was used, with 82.6% (mean r 2 = 0.36) of al- 
leles showing a significant association with the inversion 
for simulations started at maximum genetic divergence 
between chromosome arrangements. This dropped to 
7.5% (mean r 2 = 0.05) when there was a moderate level 
of divergence. Therefore the simulations suggest that we 
observed more LD than expected in all but the most ex- 
treme case. Moreover, these estimates assume an effect- 
ive population size of 10 6 , likely to be an underestimate 
forD. melanogaster populations [29]. 

Discussion 

According to the models proposed by Dobzhansky [3] 
and Kirkpatrick and Barton [14], the selective value of 
inversions comes from their ability to hold together sets 
of locally adapted alleles. However they differ on 
whether epistatic interactions are necessary among the 
alleles, and also on whether there are interactions among 
alleles in inverted and non-inverted chromosomes. 
Moreover, while the Kirkpatrick and Barton [14] model 
focuses on the spread of an inversion in populations due 
to the combination of favourable alleles, Dobzhanskys 
verbal arguments focused on combinations of alleles that 
worked together within as well as between populations, 
and emphasized that different combinations might be 
favoured in different populations even when they were 
in the same chromosomal rearrangement. 

Both these models predict that the genetic content of 
inverted and standard arrangements should differ, and 
that differences may also develop within arrangements 
at different sites. Consistent with this, our data show 
there is strong differentiation between chromosome ar- 
rangements at all sites, particularly for some chromo- 
somal regions. They also show significant differentiation 
among In(2L)t chromosomes sampled from different 
sites along a latitudinal cline that is significantly greater 
than levels of differentiation observed at the same 
markers in standard chromosomes. While a small num- 
ber of allozyme studies have shown differences in allelic 
content of the same inversion among populations in 
Drosophila (e.g., [20,21,30]), they are characterized by 
low sample sizes and analyses involving only a few loci 
[13]. Due to the low frequency of In(2L)t at high lati- 
tudes, our high latitude site also had a low sample size. 
However, this is not the case for the low and mid 



latitude sites (n > 44 chromosomes), which also show 
significant differentiation between chromosome arrange- 
ments and between sites within the same chromosome 
arrangement. 

A similar pattern of differentiation between chromo- 
some arrangements that we have observed here has been 
found in other studies [23,25], whose authors have 
argued that such patterns arise through directional 
selection maintaining divergence between chromosome 
arrangements at specific loci in the face of genetic ex- 
change between them. In our case, where there has been 
a recent introduction, is a low incidence of recombin- 
ation adequate in explaining these patterns, without the 
need to invoke selection? The clustering analysis indi- 
cates that some inverted chromosomes had the allelic 
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Figure 6 Linkage disequilibrium between marker alleles and 

ln[2L)t. Solid circles are markers in significant LD with ln{2L)t. Open 
symbols are not significantly associated with ln{2L)t. The crosses on 
the x-axis depict the locations of the ln{2L)x breakpoints. 
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Figure 7 Summary of the clustering analysis assuming two admixed populations (k = 2). Each individual is represented by a bar showing 
the individual's estimated membership to a particular cluster. Black lines separate samples with different chromosome arrangements. 



content of standard chromosomes, suggesting an on- 
going process of recombination between the arrange- 
ments. It is interesting to note that these chromosomes 
were absent in the low latitude site, suggesting stronger 
selection against standard chromosome alleles in the 
inverted background at low latitudes. 

Recombination is also suggested by the absence of 
strong genetic differentiation between northern and 
southern Australian populations in the regions spanned 
by In(2L)t, even though there is a cline for this inversion 
[31]. Recombination by crossing over should be more 
effective away from inversion breakpoints that maintain 
a strong historical signature, which may include In(2L)t 
in Drosophila [32]. However, in our analysis there is no 
tendency for genetic differentiation to decrease towards 
the centre of this large inversion. Instead a combination 
of LD and selection across multiple loci along the 
inversion may explain this pattern, although the simula- 
tions suggest that we cannot entirely rule out initial 
colonization of Australia by individuals with inverted 
and standard arrangements fixed for different alleles 
and/or a lower than expected rate of gene exchange. 



Differences in genetic variation between chromosome 
arrangements may reflect the fact that the inversion is 
derived from the standard arrangement, although it is 
not likely that this event, estimated to be some time ago 
(~ 160,000 years, [32]), would have much ongoing effect 
on microsatellite variation. Alternatively, the differences 
may reflect a lower number of founders with the 
inverted arrangement at the time of colonization (more 
than 100 years ago, [33]). However, again, we would ex- 
pect this difference to break down due to recombination 
and gene conversion unless the effective size of a popu- 
lation is particularly low. Indeed our simulations show 
that differences in allelic richness and gene diversity be- 
tween chromosome arrangements would disappear rela- 
tively quickly, within the time since colonization, if the 
frequency of double crossovers in heterokaryotypes 
matches the level expected from map distances. The ex- 
tent of LD between markers and the inversion was also 
much lower in the simulated data than was observed, 
even when the level of gene exchange between arrange- 
ments was low, except when the starting divergences 
between arrangements were extreme. 
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The outlier analysis provided evidence of directional 
selection on the Adh polymorphism along the cline, 
but only in standard chromosomes. This result is not 
unexpected given the strong association between the 
Adh F allele and standard chromosomes. Based on 
data from previous studies and their own, Veuille et 
al. [34] reported only two inverted chromosomes with 
an Adh F allele in a list of 1002 chromosomes, al- 
though 49 were expected at random. In our survey of 
538 chromosomes, including 97 inversions (18.0%) 
and 205 Adh F alleles (37.7%), we found only two In 
(2L)t chromosomes with an Adh F allele. The propor- 
tion of In(2L)t and Adh F chromosomes in our study 
was slightly higher, but not significantly different 
(Yates Chi-square = 0.01, P = 0.914), to the propor- 
tion reported by Veuille et al. [34]. The strong associ- 
ation between standard chromosomes and Adh F has 
been attributed to both selection and historical pro- 
cesses. Based on molecular variation at Adh, Veuille 
et al. [34] suggests that the historical explanation is 
more likely, with the lack of In(2L)t and Adh F chro- 
mosomes due to recent contact between different 
haplotypes that evolved in isolation. 



Conclusions 

Genetic markers situated within the inverted regions 
showed high levels of differentiation between chromosome 
arrangements, despite the potential for recombination 
between chromosome arrangements as emphasized by the 
clustering analysis. This strong differentiation particularly 
away from inversion breakpoints suggests that patterns of 
variation are partly influenced by selection and LD. Simula- 
tions provide support for this view, but more accurate 
estimates of gene exchange between chromosome arrange- 
ments are required and we cannot entirely rule out strong 
differentiation of markers in the initial colonization of 
Australia. There was also some evidence for the develop- 
ment of population differentiation within arrangements 
and particularly the inverted arrangement. Levels of genetic 
variation were lower within the inverted arrangement, 
which may reflect founder events, but selection facilitated 
by LD is also suggested because there was no evidence of 
severe reductions in effective population size. Additional 
research on combinations of chromosomal regions should 
help indicate whether there are epistatic interactions 
among regions. 

Methods 

Fly collections 

Wild D. melanogaster were collected from three loca- 
tions on the east coast of Australia between January and 
June in 2008 (Figure 1). These sites represent ends of 
the latitudinal clines in In(2L)t and the Adh F/S poly- 
morphism documented in Knibb et al. [35] and 
Oakeshott et al.[27]. They included a low latitude site at 
Innisfail (17.51°S 146 00°E), a high latitude site at 
Melbourne (37.56°S 145 10°E) and a mid-latitude site at 
Coffs Harbour (30.14°S 153 49°E). The geographical dis- 
tance between these sites ranged from 1244 to 2230 km. 

From each site between 118 and 189 isofemale lines 
were established from individual field collected females. 
These lines were maintained at 25°C under continuous 
light on a sugar (1.6% w/v), agar (3.2%), yeast (3.2%) and 
potato (1.6%) medium that was always treated with an 
antifungal agent (0.14% nipagin) and antibiotics (2% 
dihydrostreptomycin and 0.6% penicillin added to the 
medium surface). After two to 12 generations in the la- 
boratory, virgin females from each isofemale line were 
crossed to males from the isogenic y[l]; cn[l] bw[l] sp 
[1] strain (FlyBase ID: FBst0002057). The progeny from 
these crosses were preserved in 100% ethanol and stored 
at -20°C for genotyping. 

Genotyping 

DNA extraction from individual flies, PCR protocols, 
and allele scoring followed methods outlined in Gockel 
et al. [36]. Each fly was genotyped at 20 microsatellite 
loci and molecular markers for the Adh and In(2L)t 
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polymorphisms. All molecular markers were located on 
the left arm of chromosome 2, close to or within the 
breakpoints of In(2L)t (Table 1). Primer sequences and 
information about microsatellite loci are provided in 
Additional file 1. Primer sequences and protocols for 
genotyping the Adh and In(2L)t polymorphisms are 
described in Umina et al [28] and Andolfatto et al. [32] 
respectively. A maximum of three Fx progeny (mean = 
1.2) were genotyped from each isofemale line, with a 
total of 154 flies from the Innisfail site, 179 flies from 
the Coffs Harbour site and 205 flies from the Melbourne 
site. Eight individual flies from the isogenic y[l]; cn[l] 
bw[l] sp[l] strain were also genotyped for each of the 
molecular markers. 

Data analysis 

A haploid, multilocus data set of known gametic phase 
was created by subtracting the allele present in the iso- 
genic y[l]; cn[l] bw[l] sp[l] strain (the sire) from the 
genotype of each F 2 individual at each genetic marker. 
The chromosome arrangement for each haplotype was 
also determined in the same way using the genotype of 
the In(2L)t genetic marker. These data were used for all 
subsequent analyses. 

For each site/chromosome arrangement combination, 
the level of genetic variation was quantified by calculat- 
ing allelic richness (a measure of the number of alleles 
independent of sample size) and gene diversity using the 
FSTAT software package [37]. Differences in genetic 
variation between chromosome arrangements at each 
site were tested using a Wilcoxon s signed-rank test (for 
paired comparisons between two groups) and differences 
among sites within each chromosome arrangement were 
tested using a Friedman's ANOVA (for multiple paired 
comparisons). Genetic differentiation between chromo- 
some arrangements and among sites within each 
chromosomal arrangement was assessed by calculating 
Weir & Cockerham s [38] estimator of F ST . Pairwise F S t 
values, 95% confidence limits for these values and tests 
for differentiation among sites were calculated with the 
FSTAT software package [37]. We used analysis of mo- 
lecular variance (AMOVA) to partition genetic variation 
between chromosome arrangements and among sites 
within chromosome arrangements. Linkage disequilib- 
rium between marker alleles and In(2L)t were quantified 
using the r 2 coefficient. Estimates of r 2 , the significance 
level of the disequilibrium and AMOVA were calculated 
with the ARLEQUIN version 3 software package [39] . 

Tests for selection acting on marker loci were carried 
out using the F ST outlier approach [40,41] and were 
performed with the LOSITAN software package [42]. 
The method involves evaluating the relationship between 
F S t and expected heterozygosity in an island model of 
migration with neutral markers. This distribution is used 



to identify excessively high or low F S t values compared 
to neutral expectations. Such outlier loci are candidates 
for being subject to selection. Simulations were run 
using 10 000 replications, 95% confidence intervals and 
the neutral and forced mean options. An infinite allele 
mutation model was assumed. Analyses using the step- 
wise mutation model were also carried out, but they pro- 
vided similar results, so only those with the infinite 
allele mutation model are presented. 

To assess the extent of genetic mixing between chromo- 
some arrangements at each site, a cluster analysis was 
performed using the program STRUCTURE, version 2.1 
[43]. We assumed the presence of two genetic clusters 
(k = 2) to represent each of the chromosome arrange- 
ments segregating in the populations. Chromosomes were 
assigned a membership coefficient, which is the propor- 
tion of the genome that is derived from a particular clus- 
ter. STRUCTURE was run with the admixture model and 
correlated allele frequencies [43,44]. Five independent 
runs were performed using 100000 iterations, with a 
burn-in period of 10000 iterations. 

Finally, tests for a severe reduction in effective popula- 
tion size (population bottleneck) were performed for 
each site/chromosome arrangement combination using 
the software package BOTTLENECK [45]. The method 
used was based on the principle that that the number of 
alleles decreases faster than expected heterozygosity 
after a bottleneck [46]. In this situation, expected hetero- 
zygosity should be higher than the equilibrium heterozy- 
gosity predicted in a stable population from the 
observed number of alleles. Following the authors' rec- 
ommendation for microsatellite data, we used a two- 
phase model (TPM) with 95% single-step mutation and 
5% multiple-step mutations (and a variance among mul- 
tiple steps of 12). A Wilcoxon signed rank test was run 
to determine whether each sample had a significant 
excess of heterozygosity. 

With the exception of the LD calculations and tests 
for loci under selection, analyses were performed using 
markers situated within the breakpoints of In(2L)t only. 
Because sampling multiple chromosomes from within 
isofemale lines may influence LD, only a single chromo- 
some from each isofemale line were used for the LD 
analysis. Corrections for multiple comparisons were 
applied to all tests. 

Decay of linkage disequilibrium and genetic 
differentiation between chromosome arrangements 

To determine whether there has been sufficient time for 
gene exchange to breakdown LD and genetic differenti- 
ation between chromosome arrangements in the middle 
part of In(2L)t in Australian populations, simulated data 
sets were created using a modified version of the 
EASYPOP 2.0.1 computer program [47]. The simulations 
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were based on a simple model consisting of two haploid 
subpopulations, representing each of the alternate chromo- 
some arrangements. These two subpopulations were 
allowed to exchange genes between each other at a rate 
equivalent to the expected frequency of double crossovers 
and gene conversions occurring within In(2L)t each gener- 
ation. Unfortunately there are no estimates of these pro- 
cesses for microsatellite markers, and two rates of gene 
exchange were used. The first was a rate of 9.4 x 10~ 4 per 
generation, which is equal to the probability of a crossover 
event occurring within a 5 cM region (1.2 - 3.4 Mb) inside 
each of the inversion breakpoints in heterokaryotypes 
(0.05 x 0.05 x 2q(l - q), where q is the frequency of 
In{2L)t). The second was a rate of 1.0 x 10~ 4 per gene- 
ration, which is an estimate of the rate of double cross- 
overs between segregating inversions in Drosophila based 
on phenotypic markers located near the centre and inver- 
sion breakpoints [48,49]. 

The total number of chromosomes (i.e. both subpopu- 
lations combined) was set at 2.0 x 10 6 (N e ~ 10 6 , [50]). 
However, the number of chromosomes in each subpopu- 
lation varied to reflect the different frequencies of stand- 
ard and In(2L)t chromosomes segregating in tropical 
populations (0.75 and 0.25 respectively). The mutation 
rate was set at 5.65 x 10~ 6 , which is the weighted 
average of microsatellite mutation rates observed in 
D. melanogaster [51,52]. Both a single step and two- 
phased models of mutation were used. However, because 
both mutation models gave qualitatively similar results, 
only the results with the single step mutation model are 
presented. 

The simulations ran for 1400 generations (the esti- 
mated number of generations D. melanogaster has been 
in Australia assuming 10 generations per year [33]) and 
were based on 10 loci with free recombination and a 
maximum of eight alleles per locus (the average number 
of alleles at each locus). Simulations were started with 
either maximum or intermediate (F ST ~ 0.3) allele fre- 
quency differences between subpopulations. In models 
with maximum genetic divergence, subpopulations were 
fixed for different alleles at each locus. Subpopulations 
with intermediate genetic divergences were set up by 
randomly selecting alleles from all possible allelic states 
for one subpopulation and randomly selecting alleles 
from a subset of possible allelic states for the other sub- 
population. This also resulted in a difference in allelic 
richness between subpopulations, which is likely in nat- 
ural populations given the smaller N e and more recent 
origin of inverted chromosomes. 

Each generation 44 chromosomes (the minimum 
sample size from mid and low latitude sites in our 
study) were sampled from each subpopulation and 
used to calculate the level of divergence between the 
subpopulations. Simulated data sets from generation 



1400 were also analyzed with ARLEQUIN to assess 
levels of LD. 

Additional file 



Additional file 1: Table SI. Repeat type and primer sequences for 
microsatellite loci. 
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